NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Position: Topological Deep Learning is the New Frontier for Relational Learning

Papamarkou, Theodore; Birdal, Tolga; Bronstein, Michael; Carlsson, Gunnar; Curry, Justin; Gao, Yue; Hajij, Mustafa; Kwitt, Roland; Lio, Pietro; DiLorenzo, Paolo; et al (July 2024, Proceedings of the 41st International Conference on Machine Learning (ICML))

Full Text Available
Position: Topological Deep Learning is the New Frontier for Relational Learning

Papamarkou, Theodore; Birdal, Tolga; Bronstein, Michael M; Carlsson, Gunnar E; Curry, Justin; Gao, Yue; Hajij, Mustafa; Kwitt, Roland; Lio, Pietro; Lorenzo, Paolo_Di; et al (July 2024, Proceedings of the 41st International Conference on Machine Learning (ICML), 2024.)

Full Text Available
Position: Topological Deep Learning is the New Frontier for Relational Learning

Papamarkou, Theodore; Birdal, Tolga; Bronstein, Micheal; Carlsson, Gunnar; Curry, Justin; Gao, Yue; Hajij, Mustafa; Kwitt, Roland; Lio, Pietro Pietro; Di_Lorenzo, Paolo; et al (June 2024, ICML: https://openreview.net/pdf?id=Nl3RG5XWAt)

Full Text Available
Position: Topological Deep Learning is the New Frontier for Relational Learning.

Papamarkou, Theodore; Birdal, Tolga; Bronstein, Michael M; Carlsson, Gunnar E; Curry, Justin; Gao, Yue; Hajij, Mustafa; Kwitt, Roland; Lio, Pietro; Di_Lorenzo, Paolo; et al (July 2024, International Conference on Machine Learning 2024 (ICML).)

Full Text Available
HuMoR: 3D Human Motion Model for Robust Pose Estimation

https://doi.org/10.1109/ICCV48922.2021.01129

Rempe, Davis; Birdal, Tolga; Hertzmann, Aaron; Yang, Jimei; Sridhar, Srinath; Guibas, Leonidas J. (October 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV))

We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose and shape. Though substantial progress has been made in estimating 3D human motion and shape from dynamic observations, recovering plausible pose sequences in the presence of noise and occlusions remains a challenge. For this purpose, we propose an expressive generative model in the form of a conditional variational autoencoder, which learns a distribution of the change in pose at each step of a motion sequence. Furthermore, we introduce a flexible optimization-based approach that leverages HuMoR as a motion prior to robustly estimate plausible pose and shape from ambiguous observations. Through extensive evaluations, we demonstrate that our model generalizes to diverse motions and body shapes after training on a large motion capture dataset, and enables motion reconstruction from multiple input modalities including 3D keypoints and RGB(-D) videos. See the project page at geometry.stanford.edu/projects/humor.
more » « less
Full Text Available
Deformation-Aware 3D Model Embedding and Retrieval

Uy, Mikaela Angelina; Huang, Jingwei; Sung, Minhyuk; Birdal, Tolga; Guibas, Leonidas (November 2020, European Conference on Computer Vision)
null (Ed.)
We introduce a new problem of retrieving 3D models that are deformable to a given query shape and present a novel deep deformation-aware embedding to solve this retrieval task. 3D model retrieval is a fundamental operation for recovering a clean and complete 3D model from a noisy and partial 3D scan. However, given a finite collection of 3D shapes, even the closest model to a query may not be satisfactory. This motivates us to apply 3D model deformation techniques to adapt the retrieved model so as to better fit the query. Yet, certain restrictions are enforced in most 3D deformation techniques to preserve important features of the original model that prevent a perfect fitting of the deformed model to the query. This gap between the deformed model and the query induces asymmetric relationships among the models, which cannot be handled by typical metric learning techniques. Thus, to retrieve the best models for fitting, we propose a novel deep embedding approach that learns the asymmetric relationships by leveraging location-dependent egocentric distance fields. We also propose two strategies for training the embedding network. We demonstrate that both of these approaches outperform other baselines in our experiments with both synthetic and real data. Our project page can be found at deformscan2cad.github.io.
more » « less
Full Text Available
MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Huang, Jiahui; Wang, He; Birdal, Tolga; Sung, Minhyuk; Arrigoni, Federica; Hu, Shi-Min; Guibas, Leonidas (January 2021, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition)
null (Ed.)
We present MultiBodySync, a novel, end-to-end trainable multi-body motion segmentation and rigid registration framework for multiple input 3D point clouds. The two non-trivial challenges posed by this multi-scan multibody setting that we investigate are: (i) guaranteeing correspondence and segmentation consistency across multiple input point clouds capturing different spatial arrangements of bodies or body parts; and (ii) obtaining robust motion-based rigid body segmentation applicable to novel object categories. We propose an approach to address these issues that incorporates spectral synchronization into an iterative deep declarative network, so as to simultaneously recover consistent correspondences as well as motion segmentation. At the same time, by explicitly disentangling the correspondence and motion segmentation estimation modules, we achieve strong generalizability across different object categories. Our extensive evaluations demonstrate that our method is effective on various datasets ranging from rigid parts in articulated objects to individually moving objects in a 3D scene, be it single-view or full point clouds.
more » « less
Full Text Available
Quaternion Equivariant Capsule Networks for 3D Point Clouds

Zhao, Yongheng; Birdal, Tolga; Lenssen, Jan Eric; Menegatti, Emanuele; Guibas, Leonidas; Tombari, Federico (November 2020, European Conference on Computer Vision)
null (Ed.)
We present a 3D capsule module for processing point clouds that is equivariant to 3D rotations and translations, as well as invariant to permutations of the input points. The operator receives a sparse set of local reference frames, computed from an input point cloud and establishes end-to-end transformation equivariance through a novel dynamic routing procedure on quaternions. Further, we theoretically connect dynamic routing between capsules to the well-known Weiszfeld algorithm, a scheme for solving iterative re-weighted least squares (IRLS) problems with provable convergence properties. It is shown that such group dynamic routing can be interpreted as robust IRLS rotation averaging on capsule votes, where information is routed based on the final inlier scores. Based on our operator, we build a capsule network that disentangles geometry from pose, paving the way for more informative descriptors and a structured latent space. Our architecture allows joint object classification and orientation estimation without explicit supervision of rotations. We validate our algorithm empirically on common benchmark datasets.
more » « less
Full Text Available
Quaternion Equivariant Capsule Networks for 3D Point Clouds

https://doi.org/10.1007/978-3-030-58452-8_1

Zhao, Yongheng; Birdal, Tolga; Lenssen, Jan Eric; Menegatti, Emanuele; Guibas, Leonidas; Tombari, Federico (November 2020, European Conference on Computer Vision)
null (Ed.)
We present a 3D capsule module for processing point clouds that is equivariant to 3D rotations and translations, as well as invariant to permutations of the input points. The operator receives a sparse set of local reference frames, computed from an input point cloud and establishes end-to-end transformation equivariance through a novel dynamic routing procedure on quaternions. Further, we theoretically connect dynamic routing between capsules to the well-known Weiszfeld algorithm, a scheme for solving iterative re-weighted least squares (IRLS) problems with provable convergence properties. It is shown that such group dynamic routing can be interpreted as robust IRLS rotation averaging on capsule votes, where information is routed based on the final inlier scores. Based on our operator, we build a capsule network that disentangles geometry from pose, paving the way for more informative descriptors and a structured latent space. Our architecture allows joint object classification and orientation estimation without explicit supervision of rotations. We validate our algorithm empirically on common benchmark datasets.
more » « less
Full Text Available
MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

Huang, Jiahui; Wang, He; Birdal, Tolga; Sung, Minhyuk; Arrigoni, Federica; Hu, Shi-Min; Guibas, Leonidas J (January 2021, IEEE Conference on Computer Vision and Pattern Recognition)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records